Corpus: isl_wikipedia_2018_30K

Other corpora

5.2.18 Words nearly always together in sentences

Strong sentence co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/together_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency together Qoutient
SG hljómplötum 27 27 26 1.08
hljómplötum SG 27 27 26 1.08
Los Angeles 23 20 20 1.15
Angeles Los 20 23 20 1.15
tónum Íslenzkum 20 20 19 1.11
Íslenzkum tónum 20 20 19 1.11
Primula lykla 16 13 13 1.23
lykla Primula 13 16 13 1.23
Las Vegas 11 10 9 1.36
Vegas Las 10 11 9 1.36
Critics Film 9 8 8 1.13
gregoríska júlíska 9 8 8 1.13
Crime Investigation 8 7 7 1.14
Crime Scene 8 7 7 1.14
Film Critics 8 9 8 1.13
júlíska gregoríska 8 9 8 1.13
Investigation Crime 7 8 7 1.14
Investigation Scene 7 7 7 1.00
Scene Crime 7 8 7 1.14
Scene Investigation 7 7 7 1.00
506 msec needed at 2024-03-08 14:22